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Introduction 

Historically, Indian society has never paid enough attention to documenting or archiving things. 
Similarly, film industry has also suffered from the same indifference towards archiving. With 
desire to get information about Hindi film songs Hamraaz painstakingly created a set of 
documents on Hindi films/songs called Geetkosh. My effort would not have been possible 
without the Geetkosh. Moreover he has provided many updates in correspondence. Issue with 
Geetkosh information was that without listening to songs, some of the information was not 
available, was incorrect or could not be verified. I have been forwarding major corrections to 
Hamraaz. 

My effort has been to accurately capture song information that I have heard and more accurate 
way listing singers. I also tried to normalize names of artists. Arun Deshmukh has been most 
helpful in that regard. Information is embedded with songs (using Id3 tags) and kept in a music 
library program, Media Jukebox . (Media Jukebox is now used in Amazon music and is available 
free at the link) Special effort was placed on mukhda accuracy. About 33% of songs have 
mukhda updates (beyond appendix). Initially the library contained only the songs in my possession. 
Every time I found a new song, Geetkosh was referred and the book started coming apart. To minimize 
opening the book, all songs of decade of 50s were added to the song library by creating a dummy one 
second file for songs I did not have. However, I do not intend do same for any other decade. 

Geetkosh captures lot of information but my effort has been song centric. Therefore only part of 
Geetkosh information was retained in the song library. Items such as producer, director, record 
number and censor information were of no relevance to me. I have added some additional 
information fields that were helpful in doing my live radio show. 

Media Jukebox keeps all information in an internal indexed database. Indexing allows for fast 
search on complex queries. Database is independent of actual songs and I will be sharing the 
database. Use of database would require download of Media Jukebox. For those who don’t want 
that, I am making available dump of database (fields marked with *) in a separate document. 
Below are details of library fields and my comments on them. 

I will be making decades of 30s and 40s available once information has been meticulously 
checked out and corrected. 

In spite of my effort, there are bound to be something missing or incorrect. Please point out. 

Information Fields in the Library 

Year: Year of film release. For unreleased films, year is assigned based on crude approximation 
from record number. Library does not accept alpha characters hence 195x could not be used. 



Film Name: is typically how it is found in movie credits (if available), Cassette, CDs or LPs or 
record. In few cases certificate name was found to be different than in film credits. Credit name 
was used in such case. For films listed in Geetkosh appendix, word unreleased is appended to the 
film name. 

Track: Same as number in Geetkosh. For additional songs, new number is assigned. For 
multiple song versions (video and record) same number was used. 

Name: is the mukhda. It is words in the first lines of a song. Most of the sources take liberty in 
listing words of mukhda. Mukhda listed in library is exactly how the song is heard in terms of 
accurate count of word or words repeated, alaap/humming or extraneous words in the beginning 
or in middle. Better accuracy was obtained by listening to mukhda in an editor while going over 
words again and again. Another thing which helped in accuracy is keeping best quality audio file 
in library if song was available from many sources. 

Since mukhda is considered title, first letter of each word is capitalized. 

Many times alaap or other words do not add anything to the song hence it has been placed at the 
end of mukhda under brackets []. It also helps in search. If word added to the meaning of the 
mukhda, it was left at the beginning, for example words such as Hoye, Are, Ho, O, etc. 

Word repeat count is a number how many times word is repeated. For example Jhuk Jhuk Jhuk 
would be written as Jhuk 3 (without dash). For set of words which are repeated, counted is 
preceded by a dash. Set of words are either from the beginning of mukhda or from previous 
comma. 

Example 1: Aaj Ki Raat Piya-2 (song starts with Aaj) 

Example 2: Duniya Badal Rahi Hai, Aansoo Bahane Wale-2 (set of words repeated are Aansoo 
Bahane Wale) 

In some cases, segment of mukhda listed Geetkosh is in fact part of antara or it is missing from 
mukhda, I have dropped that segment from mukhda. It is noted in Notes section. 

Some subtle discrepancies identified are use of Pe vs Par , Meri vs Mori, Sajan vs Saajan, etc 

In many cases, video version is different than record version. If both are available and are 
different, both are listed in the library. Video version is marked with (video) at the end of 
mukhda. Any changes are captured in Information Updates column. Adding of alaap/humming 
is not designated as Mukhda Update. 

*File name; format is <Year>_<Film Name>_<Track #>-<mukhda>[{ video)] 

For unreleased films, year was designated as 195x. 

*Singer(s): 

Singer names are normalized. For multiple singers, they are listed in the order they have sung 
including chorus (sathi). For example, if singers are listed as Fata-Rafi and in song it is heard as 
Rafi-Fata. It is listed appropriately and marked as Singers Reordered in Information Updates 
column. If song starts with chorus then chorus is listed first but this is not designated as Singers 
Reordered. If singer could not be identified with absolute confidence, it is listed as female or 
male resisting temptation to fill in with guesswork. If singer is not known and song is not 



available, then it is listed as unknown. If song contains many non familiar singers whose voice 
could not be recognized, singer names were not reordered. 


*Composer: 

Composer names are normalized. In some cases In case of multiple composers for a movie where 
name of composer is not known for a song, all names are listed with text “or” separating them. If 
composer is not known, then it is designated as unknown. 

^Lyricist: 

Lyricist names are normalized. In case of multiple lyricists for a movie and name of lyricist is 
not known, all names are listed with text “or” separating them. If lyricist is not known, then it is 
designated as unknown. 

Film Cast: 

Artist names normalized. It is an ongoing effort. 

*Filmed on: 

If video of a song is available and artists could be recognized, they are listed. For an 
unrecognized face, “??” is listed. Since I have not seen available movies, it is an ongoing effort. 
Any help will be appreciated as it would capture information for history. Surjit Singh has been 
most helpful in this area. 

^Information Updates 

This field shows what information about the song has changed. Information updated for a song 
could be any combination (comma separated) of following items. 

Lyricist Update 
MD Update 

Mukhda Update - Deviation from the Geetkosh (including appendix) 

Singer Update - If additional singer voice heard or listed singer not heard 
Singers Reordered 

* Notes: 

This is a free format field where some relevant information for a song is listed. Information 
could be Extra song, No Record Issued, Bad audio. Segment “<list of words > ” not part of 
mukhda, note about year estimation for a song from unreleased film, first film of artist, etc. 

* Duration: 

Song duration is minutes and seconds. Duration of 1 sec implies I don’t have the song. Duration 
is also impacted by speed, how much prelude song had and keeping the most complete song 
available from various sources. 

Prelude: 

This is the time in seconds of prelude music before words start. This was crucial to my radio 
show. I wanted to not play prelude music and start song with words and was difficult in a live 
show. With availability of prelude time, I would start the song but on zero volume and start 



speaking about the song with eye on song marker. As soon as prelude time was up, volume 
control for song would be moved up. Few observant listeners wondered about it when their 
request was filled in real time and without prelude music. Songs where prelude music is crucial 
to the song, it was played with prelude music. For songs where there is significant gap between 
alaap and start of mukhda, both times are captured with a dash separating them. This allowed me 
to decide whether play alap and ignore it. 

In many cases all or part of prelude is missing because some collectors would start playing the 
song and then push record button. 

Genre: I have used it liberally to create my own tags. Impetus was to use song classification for 
ease in doing live radio show. Some values used are Qawwali, Mujra, Patriotic, Philosophical, 
Lori, Birthday, Holi, Diwali, Rakhi, etc. 

Source is for my personal use so I know source of the song. 



